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Abstract 

Background: In approximately 80% of patients, ovarian cancer is diagnosed when the patient is already in the 
advanced stages of the disease. CA125 is currently used as the marker for ovarian cancer; however, it lacks 
specificity and sensitivity for detecting early stage disease. There is a critical unmet need for sensitive and specific 
routine screening tests for early diagnosis that can reduce ovarian cancer lethality by reliably detecting the disease 
at its earliest and treatable stages. 

Results: In this study, we investigated the N-linked sialylated glycopeptides in serum samples from healthy and 
ovarian cancer patients using Lectin-directed Tandem Labeling (LTL) and iTRAQ quantitative proteomics methods. 
We identified 45 N-linked sialylated glycopeptides containing 46 glycosylation sites. Among those, ten sialylated 
glycopeptides were significantly up-regulated in ovarian cancer patients' serum samples. LC-MS/MS analysis of the 
non-glycosylated peptides from the same samples, western blot data using lectin enriched glycoproteins of various 
ovarian cancer type samples, and PNGase F (+/-) treatment confirmed the sialylation changes in the ovarian cancer 
samples. 

Conclusion: Herein, we demonstrated that several proteins are aberrantly sialylated in N-linked glycopeptides in 
ovarian cancer and detection of glycopeptides with abnormal sialylation changes may have the potential to serve 
as biomarkers for ovarian cancer. 

Keywords: Ovarian cancer, Quantitative proteomics, Sialylation, Lectin, N-linked glycopeptides, Mass spectrometry, 
Western blot 



Background 

The American Cancer Society estimates that in 2011, 
about 21,990 new cases of ovarian cancer will be diag- 
nosed and 15,460 women will die of ovarian cancer in 
the United States (ovariancancer.org) [1-3]. When ovar- 
ian cancer is detected early, the five year survival rate is 
over 90% [4]. Serum measurement of CA125, the 
current standard, has an early stage detection rate of 
only about 28% and when combined with ultrasound 
still only identifies 48% [5,6]. Development of improved 
diagnostic tools for early detection of ovarian cancer, in- 
cluding the discovery of new ovarian cancer biomarkers, 
has the potential to significantly improve the survival 
rate. 
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It has been shown that in the cancer transformation 
process, changed expression and post translational 
modification of proteins occurs, resulting in a change in 
the protein structure and function. Investigating these 
modifications specific for cancer may provide vital infor- 
mation and serve as biomarkers for the diseased state. 
Glycosylation is a common and essential form of post 
translational modification of proteins. Among all the gly- 
cosylation forms, sialylation has received much attention 
owing to the strong correlation between the sialylation 
aberration and cancer [7]. Sialic acid residues are known 
to be linked via an R-2,3 or an R-2,6 bond to Gal/GalNAc 
in proteins. SNA lectin binds to peptides carrying a sialic 
acid residue connected to the underlying sugar chains 
through an R-2,6 linkage. It has been suggested that there 
is an increased branching of glycan structures in cancer 
along with the increased expression both at RNA and 
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protein level of sialyltransferase [8-10], which leads to a 
global increase in sialyation of the proteins [11]. Increased 
activity of sialyltransferase is also shown to be accom- 
panied by an increase in the level of enzymes, such as 
ST6Gal-l, which is responsible for linking sialic acid to 
galactose in colorectal, ovarian and breast cancers 
[10,12-14]. ST6Gal-l has been implicated in cell-cell 
interaction, enhanced motility and increased invasive- 
ness of tumor cells [10]. 

In the last decade, with the evolution of proteomics 
and glycomics technologies, the potential for the identi- 
fication of biomarkers has increased tremendously, in 
spite of the extreme complexity of the serum with a dy- 
namic range in concentration of several orders of magni- 
tude [15]. In order to identify these low abundant 
disease marker proteins in serum, various methods have 
been developed to deplete the abundant proteins such as 
albumin and IgG, which constitutes about 90% of the 
serum protein concentration [16,17]. Alternatively, sev- 
eral methods were developed to enrich a specific class of 
proteins, such as glycoproteins by a lectin affinity en- 
richment strategy, which can increase the chances for 
the identification of elusive glycosylation changes in low 
abundant proteins. Recent advances in glycoproteomics 
have made it possible to probe specific glycosylation 
changes [18], in particular sialylation changes [19-21], in 
proteins between the disease and normal state. The level 
of sialic acid was observed to be significantly elevated in 
ovarian cancer patients plasma compared to the healthy 
controls [19,20]. Berbec et al. [21], reported that the 
average concentration of sialic acid in total serum in 
ovarian cancer patients was significantly higher than in 
the healthy control group and may reflect the develop- 
ment of malignancy and should be considered as a sup- 
porting tumor marker in ovarian cancer diagnosis. In 
recent years, several groups have investigated the sialyla- 
tion aberration in the glycoproteome of cancer serum 
samples using diverse proteomics strategies including 
lectin affinity, hydrazine chemistry, HPLC and chemical 
enrichment methods [18,22-27]. 

In our previous work, we probed the prostate cancer 
serum glycoproteome by employing the lectin-directed 
tandem labeling (LTL) quantitative proteomics method 
[28] and identified several N-linked sialylated glycopep- 
tides that showed significant sialylation aberration be- 
tween normal and prostate cancer serum samples. In the 
current study, we report the results of the sialylation ab- 
erration analysis in ovarian cancer-associated N-linked 
glycoproteins. We employed the LTL method to identify 
N-linked sialylation sites and accurately identified the 
changes in sialylation between normal and ovarian can- 
cer serum samples based on the N-deglycosylated pep- 
tide analysis [29,30]. We used SNA lectin to capture 
sialylated glycopeptides and, for quantitation, we used 



acetyl ( 1 H 3 / 2 D 3 ) labeling at the N-terminus in combin- 
ation with ls O labeling during PNGase F digestion for 
glycosylation site mapping. Further, iTRAQ quantitative 
analysis of non-glycosylated peptides from the same 
samples revealed that the observed sialylation changes in 
cancer serum samples are independent of the glycopro- 
tein concentrations. The quantitative proteomics results 
were further verified by western blot analysis of SNA 
enriched selected glycoproteins that are strongly impli- 
cated in ovarian cancer. 

Results 

Identification of glycopeptides and determination of 
glycosylation sites 

The glycopeptide identification and glycosylation site de- 
termination were achieved by following the strategy as 
outlined in Figure 1. First, abundant IgG glycoproteins 
were depleted from normal and ovarian cancer patients' 
serum samples. Then equal amounts (5 mg each) of pro- 
tein from normal and cancer serum samples were digested 
by trypsin and glycopeptides containing sialic acid were 
enriched by using SNA lectin. The resulting sialylated gly- 
copeptides were labeled using light and heavy isotopes of 
acetic anhydride reagents and mixed. Next, N-linked gly- 
cans were cleaved by PNGase F in the presence of heavy 
( ls O) water to introduce a 3 Da mass shift with the aim 
to unequivocally identify glycosylation sites. Finally, the 
N-deglycosylated peptide mixture was analyzed by nano 
LC-MS/MS to identify peptide sequences in addition to 
determining the glycosylation sites. Each glycopeptide 
identified in the database search results was inspected for 
the NXS/T consensus sequence as well as for a 3 Da mass 
shift. Also, the MS/MS spectrum of each glycopeptide was 
verified manually and unambiguously characterized the 
complete peptide sequence. For instance, two glycopep- 
tides (WLHPNYSQVDIGLIK and NLFLNHSENATAK) 
were identified for haptoglobin and the tandem mass 
spectrometry data in Figure 2b shows all the signature 
ions (b and y ions) confirming the sequence of the light 
isotope of CH 3 CO-WLHPD( 18 0)YSQVDIGLIK-OCCH 3 
glycopeptide sequence. Similarly, in the MS/MS spectrum 
of Figure 2a, all b and y ions, including a shift of 3 Da 
modification in b ions (b 13 , b 14 , b 15 ) and y ions (y 2 , y 4 , y 5 ) 
due to heavy acetyl group at the N-terminus, confirm the 
identity of the heavy isotope of the CD 3 CO-VVLHPD 
( 18 0)YSQVDIGLIK-OCCD 3 glycopeptide. It should be 
noted that the above two peptides possess NXT/S consen- 
sus sequences and, in both peptides, asparagine is modi- 
fied to aspartic acid, as evidenced by 3 Da mass shift for 
b 9 ion in Figure 2a and 2b. However, the total mass is 
increased by 6 Da from light labeled peptide to heavy la- 
beled peptide because of the acetylation labeling at the N- 
terminus as well as at the c-terminal lysine side chain. The 
other haptoglobin glycopeptide, NLFLNHSENATAK, was 
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Figure 1 LTL quantitative proteomics strategy used for the identification and quantification of sialylated N-linked glycopeptides in 
normal and ovarian cancer sera. 
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Figure 2 Tandem mass spectra of doubly charged ions of heavy CD 3 C0- VVLHPD( 18 0)YSQVDIGLIK- C0CD 3 and CH 3 C0- VVLHPD( 18 0) 
YSQVDIGLIK- 0CCH 3 peptides as identified by LTL quantitative proteomics in normal and ovarian cancer sera analysis. 
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identified with 2 glycosylation sites and this is the only 
glycopeptide identified with more than one glycosylation 
site, as corroborated by its light and heavy labeled peptide 
tandem mass spectrometry data shown in Figure 3a and 
3b, respectively. Similar observations were made for 
CD 3 CO-HAD( 18 0)WTLTPLK (PON1) and CD 3 CO 
DIVEYYnDSD( 18 0)GSHVLQGR (alpha-2-glycoprotein 1, 
zinc) glycopeptides as confirmed by their MS/MS data 
(Figure 4). For these two proteins, the light labeled ver- 
sions of the peptides were not identified in our analysis. 
The total number of glycopeptides identified in the 
current study and the relevant details are given in Table 1. 
In total, we identified 45 glycopeptides derived from 30 
sialylated glycoproteins. Overall, we were able to identify 
46 glycosylation sites in 45 N-linked glycopeptides and all 
these sites have been reported in the literature (Swissprot 
database). The majority of the sialylated glycopeptides 
identified in the current study were previously reported by 
various proteomics methods signifying the strength of our 
LTL quantitative method to investigate glycosylation aber- 
rations and glycosylation sites in proteins in cancer. Fur- 
thermore, identification of low abundant serum proteins 
including PON 1 (25 jig/mL) [31], Ficolin-3 (32.4 jig/mL) 
[32], and Kallikrein (2.9 (ig/mL) [33] in our analysis dem- 
onstrate the high sensitivity of the LTL method. 

Quantitative analysis 

We investigated the differences in sialylated glycopep- 
tides in normal and ovarian cancer serum samples by 



quantitating the N-deglycosylated peptides as described 
in the experimental procedure. Table 1 summarizes the 
results of the quantification of N-deglycosylated peptides 
and the corresponding proteins. As evident in Table 1, 
quantitative ratios are very different and vary signifi- 
cantly between different peptide sequences of each pro- 
tein suggesting prominent differences at the site of 
sialylation. For example, the sialylation of glycopeptides 
WLHPNYSQVDIGLIK (haptoglobin), NLFLNHSENA- 
TAK (haptoglobin), HANWTLTPLK (PON1) and 
DIVEYYNDSNGSHVLQGR (alpha-2-glycoprotein 1, 
zinc) are increased by 2.7 fold, 2.0 fold, 4 fold and 2 fold, 
respectively, as confirmed by the XIC s of their light and 
heavy isotopes (Figure 5). Similarly, out of 30 glycopro- 
teins, sialylation is increased in 10 proteins that are iden- 
tified by multiple N-deglycosylated peptides in ovarian 
cancer serum samples. However, sialylation is not 
increased in all the glycopeptides from these up- 
regulated glycoproteins and. interestingly; we were also 
able to identify differential sialylation in multiple glyco- 
peptides from a single protein. For example, three N 
linked glycopeptides (EHEGAIYPDNTTDFQR - 3 fold 
increase, ENLTAPGSDSAVFFEQGTTR- 1.9 fold in- 
crease, AGLQAFFQVQECNK - and 2.2 fold decrease) 
were identified in ceruloplasmin protein with differential 
sialylation (Table 1). In contrast, we identified only one 
glycopeptide, NTTCQDLQIEVTVK with a 3 fold de- 
crease in sialylation and no change was observed in the 
other glycopeptide, GLNVTLSSTGR, from Complement 
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Figure 3 Tandem mass spectra of doubly charged ions of heavy CD 3 CO- NLFL D( 18 0)HSED( 18 0)ATAK- COCD 3 and light CH 3 CO- NLFL D 
( 18 0)HSED( 18 0)ATAK- OCCH 3 peptides as identified by LTL quantitative proteomics in normal and ovarian cancer sera analysis. 
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Figure 4 Tandem mass spectrum of doubly charged ions of (a) heavy CD 3 CO- HA D( 18 0)WTLTPLK peptide and (b) CD 3 CO-DIVEYYNDSD 
( 18 0)GSHVLQGR. Peptide as identified by LTL quantitative proteomics in normal and ovarian cancer sera analysis. 



C4-A protein. Out of 45 glycopeptides identified, sialyla- 
tion was increased in 10 peptides and decreased in 2 
peptides by more than 2 fold in each case. Standard de- 
viation in the current ovarian cancer glycopeptide ana- 
lysis was assumed to be 9% based on our previous 
triplicate analysis of Fetuin N-linked glycopeptides using 
the LTL quantitative proteomics method [28]. 

In order to determine whether the changes in sialy- 
lated glycopeptides are a result of the alterations in the 
level of sialic acid at the identified site or at the concen- 
tration of the parent glycoprotein levels, we performed 
quantitative analysis of the non-glycosylated peptides 
obtained from the same samples. These peptides were 
obtained from the flow- through peptide mixtures of the 
SNA lectin enrichment step of normal and cancer serum 
samples. The non-glycosylated peptides were further 
purified, labeled by iTRAQ reagents (in triplicate), frac- 
tionated by SCX chromatography and analyzed by LC- 
MS/MS experiments (in triplicate) using Orbitrap MS. 
Quantitative analysis was performed using proteome dis- 
coverer software. We identified more than 100 serum 
proteins out of which 28 glycoproteins were identified in 
the N-deglycosylated peptide analysis using the LTL 
method (Table 2). The concentration remained 



unchanged for all the 28 proteins between normal and 
ovarian cancer serum samples with overall heavy to light 
ratios ranged from 0.8 to 1.5. These 28 proteins also in- 
clude the 9 glycoproteins identified by LTL method in 
which the sialylation, as analyzed by the glycopeptides, is 
increased (indicated with asterisk in Table 1). This sug- 
gests that sialylation indeed increased in ovarian cancer 
for the 10 N-linked glycopeptides corresponding to the 
9 glycoproteins identified by the LTL method and it is 
independent of their protein concentration. For the 
remaining 18 proteins, we observed no change in sialyla- 
tion or protein concentration levels. 

Western blot analysis 

Validation of differentially sialylated glycoproteins 

To further verify our quantitative results obtained by LTL 
and iTRAQ methods, western blot analysis was performed 
using specific antibodies targeting the selected serum gly- 
coproteins. We selected haptoglobin and PON1 [34], 
based on the implication of their glycosylation, in particu- 
lar the sialylation aberration in ovarian cancer and other 
cancers. Zinc- alpha-2 -glycoprotein was chosen due to its 
significant biomarker potential for various cancer indica- 
tions [35]. Western blot analysis was performed using 



Table 1 List of identified glycosylation sites and the results of quantitation of N-deglycosylated peptides 
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Table 1 List of identified glycosylation sites and the results of quantitation of N-deglycosylated peptides (Continued) 

Proteins with no change in sialylation 

P02763 Alpha-1 -acid glycoprotein 1 

P19652 Alpha-1 -acid glycoprotein 2 
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90 



1637.20 1636.74 0.46 58 



1607.73 1606.86 0.87 74 



1240.06 1239.57 0.50 66 



1.8 

1.0 

1.3 

1.3 
1.1 
1.6 

1.8 

1.1 

1.4 

1.1 
1.4 

1.0 
1.9 

1.8 

1.6 
1.1 



1.0 



1.3 



1.4 



1.1 



1.4 



0.8 



1.4 



3 
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"D — 
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II 

is 
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Table 1 List of identified glycosylation sites and the results of quantitation of N-deglycosylated peptides (Continued) 



Q08380 


Galectin-3-binding protein 




















1.5 






ALGFENATQALGR 


69 


2 


696.82 


1391.63 


1391.69 


-0.06 


66 


1.5 




P05546 


Heparin cofactor 2 




















1.7 






NLSMPLLPADFHK 


49 


2 


785.62 


1569.22 


1568.81 


0.41 


60 


1.7 




P 19823 


Inter-alpha-trypsin inhibitor 
heavy chain H2 




















1.0 






GAFISNFSM7VDGK 


118 


2 


781.10 


1560.18 


1559.70 


0.48 


93 


1.0 




P03952 


Kallikrein 




















1.2 






IYSGILNLSDITK 


453 


2 


762.29 


1522.57 


1522.80 


-0.23 


90 


1.2 




P01042 


Kininogen-1 




















1.2 






LNAENNATFYFK 


294 


2 


740.53 


1479.05 


1478.70 


0.35 


80 


1.2 








I^SIVQTNCSK 


205 


2 


731.20 


1460.38 


1460.71 


-0.33 


64 


1.6 








YNSQNQSNNQFVLYR 


48 


2 


967.71 


1933.40 


1932.88 


0.52 


0.52 


0.9 




P05155 


Plasma protease CI inhibitor 




















1.3 






DTFVNASR 


238 


2 


479.10 


956.18 


956.45 


-0.27 


38 


1.3 




P00734 


Prothrombin 




















1.8 






NFTENDLLVR 


416 


2 


634.62 


1267.22 


1267.63 


-0.41 


45 


1.8 




P04004 


Vitronectin 




















1.2 






NGSLFAFR 


169 


2 


485.67 


969.32 


969.48 


-0.16 


53 


1.2 




Proteins with dicrease in sialylation 






















P0C0L4 


Complement C4-A 




















0.7 






GLNVTLSSTGR 


1335 


2 


575.20 


1148.39 


1148.59 


-0.20 


72 


1.2 








NTTCQDLQIE\m/K 


1391 


2 


875.54 


1 749.07 


1 748.83 


0.24 


72 


0.3 





List of identified glycosylation sites and the results of quantitation of N-deglycosylated peptides obtained from normal and ovarian cancer sera by using LTL quantitative proteomics method. 
* The RSD of this method is 9% as determined by the reproducibility studies with fetuin [28]. 

+ * The ratio change in the glycopeptides of these proteins is due to increase in sialylation and not due to the increase in protein concentration. 
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WLH PN YSQVDIGLIK 
Haptoglobin 



2.7 




NLFLNHSEMATAK 
Haptoglobin 




i i i i 

33 



i i i i i i i i i i i i i i i i i i i i i 
34 35 36 37 

Time (min) 



HAMWTLTPLK 
PON1 




41 



42 



DIVEYYN DSNGSH VLQGR 
Zinc-alpha-2-glycoproftein 




61.0 



61.5 62.0 
Tine (mil) 



62.5 



Figure 5 The XICs and quantitative ratios of light and heavy deglycosylated peptides identified in the LTL proteomics analysis of 
ovarian cancer serum. These ratios were obtained by calculating the peak areas in the XICs of precursor peptides using Xcalibur software. 



proteins obtained before and after the lectin enrichment 
of normal and cancer serum samples (Figure 6). The data 
indicates that before SNA enrichment, there is no signifi- 
cant difference in the levels of haptoglobin and zinc- 
alpha-2-glycoprotein between the normal and cancer 
serum samples, whereas cancer serum contained lower 
levels of the PON1 glycoprotein. The western data is in 
agreement with the non-glycosylated peptide analysis, 
which showed no difference between normal and cancer 
samples at the protein level. However, after SNA lec- 
tin enrichment, haptoglobin, PON1 and zinc-alpha-2- 
glycoproteins show a significant increase in protein 
concentration and their levels are comparable to the 
deglycosylated peptide H/L ratios (Figure 6a) indicating 
that the sialylation is indeed increased in these proteins 
in ovarian cancer. Overall, the western blot data corro- 
borates the LTL quantitative ratios of the deglycosylated 
peptides. The minor differences between the western 
blot data and the glycopeptide quantitation results may 
be attributed to the contribution of the O-linked sialy- 
lated fraction of each protein which is further investi- 
gated as described below. 



Individual patient sample analysis 

In order to ascertain the effect of pooling of the patient 
samples on the sialylation differences between normal 
and cancer, we have performed western blot analysis of 
individual samples obtained from serous, endometroid 
and clear cell carcinoma of stage I and Stage II-IV ovarian 
cancer patients. These samples were subjected to SNA 
lectin enrichment and probed with PON1 and haptoglo- 
bin antibodies. We also ran the pooled stage I and stage 
II-IV samples and pooled healthy female samples as con- 
trols and the corresponding data is given in Figure 6b. We 
observed no significant differences between the stages and 
types of ovarian cancers. In general, there was only a small 
difference between the cancer samples and the healthy 
controls at the glycoprotein level as assessed by the 
western blot. 

Investigation of O-sialylated glycoprotein contribution 

We have also carried out PNGase (+/-) experiments after 
SNA enrichment of intact glycoproteins and western blot 
analysis using haptoglobin specific antibodies. As illu- 
strated in Figure 6c, multiple bands were identified in the 
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Table 2 List of glycoproteins identified and their iTRAQ quantitative ratios obtained from the triplicate analysis of 
nonglycosylated peptides of normal and ovarian cancer sera 



Swissprot 
ID 


Description 


# 

Peptides 


Sequence 
coverage 


Protein ratio (Cancer/Normal) 
(Mean±Stdev) 


P01011 


Alpha-1 -antichymotrypsin 


6 


13.2 


1 .5±0.03 


P01008 


Antithrombin-lll 


7 


14.4 


1 .5+0.07 


P02763 


Alpha-1 -acid glycoprotein 1 


4 


17.4 


13+0.14 


P04114 


Apolipoprotein B-100 


63 


12.8 


1 .3±0.09 


P00748 


Coagulation factor XII 


3 


4.1 


1.3+0.39 


P19652 


Alpha-1 -acid glycoprotein 2 


3 


12.9 


1.3+0.17 


P01009 


Alpha-1 -antitrypsin 


12 


26.3 


1 .2+0.23 


P00738 


Haptoglobin 


12 


26.4 


1 .2+0.02 


P00450 


Ceruloplasmin 


11 


10.6 


1 .2+0.02 


P25311 


Zinc-alpha-2-glycoprotein 


5 


17.1 


1.1+0.02 


P05155 


Plasma protease CI inhibitor 


3 


5.6 


1.1+0.03 


P36980 


Complement factor H-related protein 2 


3 


11.1 


1+0.20 


P02790 


Hemopexin 


7 


15.2 


1+0.04 


P0C0L4 


Complement C4-A 


26 


14.3 


1+0.05 


P00734 


Prothrombin 


8 


14.8 


1+0.04 


P08185 


Corticosteroid-binding globulin 


2 


4.0 


1+0.10 


P02749 


Beta-2-glycoprotein 1 


2 


7.5 


0.9+0.08 


P27169 


Serum paraoxonase/arylesterase 1 


5 


13.0 


0.9+0.05 


075882 


Attractin 


6 


4.3 


0.9+0.07 


P08603 


Complement factor H 


17 


15.0 


0.9+0.1 1 


P 10909 


Clusterin 


6 


11.6 


0.9+0.03 


P 19823 


Inter-alpha-trypsin inhibitor heavy chain H2 


10 


9.5 


0.8+0.09 


P03952 


Plasma kallikrein 


8 


11.9 


0.8+0.13 


P04004 


Vitronectin 


4 


7.3 


0.8+0.03 


P05546 


Heparin cofactor 2 


7 


13.2 


0.8+0.04 


P01042 


Kininogen-1 


11 


16.0 


0.8+0.06 


P02765 


Alpha-2-HS-glycoprotein 


7 


19.4 


0.8+0.03 


P02787 


Serotransferrin 


26 


41.4 


0.7+0.06 



PNGase (+) experiment from pooled normal and cancer 
(SI and SII-IV) samples. The identification of multiple 
bands may be explained due to the glycosylation hetero- 
geneity in haptoglobin and partial cleavage of N-linked 
glycans by PNGase treatment. However, only a very low 
intense band was identified at the 50 kDa region in 
PNGase (+) experiment as compared to the PNGase (-) 
experiment. This observation is true for both the normal 
and cancer (SI and SII-IV) samples, although the inten- 
sity of the 50 kDa band in PNGase (+) experiment is 
slightly higher in case of the cancer sample. This suggests 
that the amount of O-linked sialylated haptoglobin con- 
tributing to the N-linked sialylated haptoglobin in SNA 
enrichment process is very low and it may not have a 
major impact in the determination of sialylation differ- 
ences at the glycopeptides level between normal and 



cancer. Thus we have proved that the sialylation differ- 
ences in N-linked glycopeptides of haptoglobin are in- 
deed true and significant. 

Discussion 

The strategy involving the investigation of glycan specific 
changes at the glycopeptide level has many advantages 
and it is indeed important over examining the same 
changes at the protein level. It is very well established 
that the glycosylation changes are unique to a particular 
site in a glycoprotein and its association with various 
cancers. It is not possible to accurately examine these 
changes at the glycoprotein level due to the high glyco- 
protein heterogeneity and technical limitations. Enrich- 
ment of intact glycoproteins using lectins may be 
comprised by a mixture of N-linked and O-linked 
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a) 



Non-glycosylated Before SNA After SNA N-Deglycosylated 
peptide H / L ratio peptide H / L ratio 



b) 



c) 



Haptoglobin 


1.2 


^BW^ ^^^^ 


2.7 


PON1 


0.9 






4.0 


Zmc-alpha-2- 
glycoprotein 


1.1 






2.0 



2 <s • 



o o 



Sample Type 
Stage 



Serous Endometrokl Clear cell EndometrokJ 
carcinoma adenocarcinoma carcinoma carcinoma Pooled 



Pooled Pooled Pooled 
SI SII-IV Normal 



PNGase 
50kDa 



Haptoglobin 



Clear cell 

carcinoma Pooled 



Serous 
carcinoma Pooled 



Stage HV Stage IHV Normal Stage I Stage 1 Stage IUV stage HV Normal 



Haptoglobin 




SNA lectin - + -+- + -+- + _+_+_ + _+_ + 



Pon1 



Figure 6 Western blot analysis: a) Validation of differentially sialylated glycoproteins b) Investigation of O-sialylated glycoprotein 
contribution c) Pooling vs individual patient sample analysis. 



protein fractions in different ratios. This leads to ambi- 
guity in the assessment of their actual concentrations by 
validation experiments like western blot. Therefore, 
prior investigation of the dynamic glycosylation changes 
at the glycopeptide level is imperative to discover poten- 
tial cancer specific biomarkers. 

Lectin enrichment has been widely used to augment 
glycoprotein biomarker discovery in various cancers 
[7,27,28]. In order to investigate sialic acid aberrations, 
SNA lectin was used to enrich sialic acid containing glyco- 
proteins in ovarian cancer. Glycosylation differences in N- 
linked glycopeptides was analysed by finding differences 
indirectly in deglycosylated peptides without compromis- 
ing the glycan selection. Recently Ueda et al. [27], devel- 
oped a novel isotopic glycosidase elution and labeling on 
lectin-column chromatography (IGEL) method, which is 
equally useful to investigate glycosylation changes at the 
peptide level and up to 8 samples can be analyzed simul- 
taneously using iTRAQ technology. Herein, we used the 
LTL method to investigate the global sialylation changes 
in ovarian cancer on each glycoprotein via deglycosylated 
peptide analysis. There are many advantages associated 
with the LTL method [28] in probing the glycosylation 
changes at the glycopeptide level. In this study, we 
observed the consensus NXS/T sequence and an ls O tag 
in each glycopeptide identified, which significantly in- 
creased the confidence and enabled the identification of 
true positive N-glycopeptides based on the 3Da mass shift 
at the glycosylation site. This approach not only simplified 
the complexity of various glycopeptides, but also increased 
the sensitivity considerably to identify a particular 



sialylated peptide with high quantitative accuracy on the 
sialylation changes. 

Quantitative analysis of deglycosylated and non- 
glycosylated peptides revealed a significant increase in 
sialylation of the glycoproteins while the concentration 
of the glycoproteins remained unchanged in ovarian 
cancer (Table 1). Western blot analysis of SNA enriched 
selected glycoproteins further confirmed this observation 
(Figure 6). Our results are further supported by reports 
in the literature on the implication of the selected pro- 
teins in various cancers including ovarian cancer. Up- 
regulation of glycosylation in haptoglobin has been 
reported in pancreatic cancer [36-38]. Glycopeptide, 
NLFLNHSENATAK from haptoglobin has been reported 
to have de-sialylated, mono -sialylated and completely 
sialylated glycan structures, while, only the completely 
sialylated structure has been shown to be up-regulated 
in pancreatic cancer samples compared to the controls 
[7]. In addition to the changes in glycosylation, elevation 
of the haptoglobin protein levels in the serum and ascitic 
fluid of ovarian cancer patients has been reported in pre- 
vious studies [38,39]. Similarly, PON1 has been impli- 
cated in ovarian cancer and it was shown to be involved 
in a variety of enzymatic activities including lactonase, 
arylesterase, peroxidase, and phospholipase activities 
[34], It has been suggested that women who carry PON1 
alleles associated with reduced PON1 activity and those 
with lower concentrations might be at higher risk for 
developing ovarian cancer [40]. It was also reported that 
PON1 genetic variants may alter the risk of epithelial 
ovarian cancer among smokers and obese women. Zinc- 
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alpha-2 -glycoprotein has been proposed as one of the 
biomarkers for predicting the response to thalidomide- 
based therapy in newly diagnosed multiple myeloma 
patients [41]. It is also a potential molecular marker for 
biochemical relapse in men with margin-positive, loca- 
lized prostate cancer [42], and differential regulation of 
this protein has been linked to gastrointestinal cancer 
[43], prostate cancer [44], cancer cachexia [45], and 
bladder cancer [46]. However, no glycosylation aberra- 
tion was reported for this protein and linked to any can- 
cer including ovarian cancer. On the contrary, we found 
significant increase in sialylation in our deglycosylated 
peptide and western blot analysis of SNA enriched ovar- 
ian cancer serum samples. Our data suggests that glyco- 
sylation aberrations, as assessed at the glycopeptides 
level, are a highly sensitive and specific tool for bio- 
marker discovery in various cancers. 

Significant sialylation aberrations were observed in 
ovarian cancer when we compared our quantitative 
results on the sialylated glycopeptides corresponding to 
the glycoproteins reported in the literature. Similar to 
our findings, changes in glycosylation have been 
reported for alpha- 1 -antitrypsin glycoprotein in ovarian 
cancer patients [47]. Glycosylation changes in serotrans- 
ferrin have not been reported earlier in ovarian cancer. 
However, ovarian cancer cells have been shown to se- 
crete serotransferrin in chemically defined medium and 
up-regulation of serum levels of four isoforms of sero- 
transferrin have been observed in 42% of cancer patients 
after six cycles of chemotherapy [48]. In Our study in 
which significant changes in serotransferrin sialylation 
were observed in multiple glycopeptides in ovarian can- 
cer (Table 1), along with published reports suggesting 
serotransferrin could be a potential biomarker for ovar- 
ian cancer at both glycoprotein and glycopeptides level. 
Furthermore, Liu et al. [49], reported significant sialyla- 
tion changes in TLNQSSDELQLLSMGNAMFVK glyco- 
peptide and no change in the LSLGAHNTTLTEILK 
glycopeptide, from alpha- 1-antichymotryspin glycopro- 
tein in ovarian cancer. In our analysis, we identified a 
different sialylated peptide, YTGNASALFILPDQDK 
from alpha- 1-antichymotryspin with no change in 
silaylation. 

Evaluation of individual vs pooled samples of various 
types and stages of ovarian cancer by western blot analysis 
revealed no differences at the glycoprotein level suggesting 
there is minimal loss of significance in using pooled sam- 
ples for discovery of biomarkers. However, extensive vali- 
dations have to be performed with significant number of 
individual samples prior to the selection of potential bio- 
markers for diagnostics development. In general, our data 
is in accordance with previously reported studies and 
emphasizes that the sialylation status of the glycopro- 
teome in cancer is highly dependent on the type of 



samples compared and disease status and is independent 
of the concentration of the parent glycoproteins. 

We recently reported the sialylation aberrations in 
prostate cancer serum samples by employing the LTL 
method and unambiguously identified and quantitated 
45 N-linked sialylated glycopeptides. In this study, we 
compared the N-linked sialylated glycopeptidome of 
prostate cancer serum samples with ovarian cancer data. 
The results of the comparative analysis of all the glyco- 
peptides identified in these two cancers are given in 
Table 3. Interestingly, a majority of the glycopeptides 
identified in prostate cancer was also identified in ovar- 
ian cancer; however, we observed significant differences 
in the sialylation of most of these glycopeptides in these 
two cancers. While the sialylation of 6 glycopeptides 
increased in both cancers, the sialylation of 5 other gly- 
copeptides increased only in ovarian cancer. In addition, 
the sialylation is significantly elevated in 16 glycopep- 
tides only in prostate cancer suggesting the differential 
prevalence of sialylation aberration in various cancers. 
Although 28 glycopeptides were common to these two 
cancers, 17 glycopeptides were uniquely identified in 
ovarian cancer and 17 different glycopeptides were 
uniquely identified in prostate cancer (Figure 7). The 
data is intriguing as to the potential of glycosylation 
aberrations at the glycopeptide level as cancer specific 
biomarkers. 

Conclusions 

We successfully used LTL and iTRAQ quantitative pro- 
teomics methods in conjunction with western blot ana- 
lysis to investigate the sialylation aberration in N-linked 
glycopeptides in normal and ovarian cancer serum sam- 
ples. In the current study, we identified 30 glycoproteins 
with 45 N-linked sialylated glycopeptides comprising 46 
glycosylation sites. We observed increases in sialyation 
of 10 N-linked glycopeptides in ovarian cancer, which is 
independent of their protein concentrations as con- 
firmed by the non-glycosylated peptide analysis. Sialyla- 
tion change in some of the selected peptides was also 
confirmed by the western blot analysis of SNA enriched 
precursor glycoproteins. Overall, we observed a good 
correlation between the proteomic analysis and western 
blot experiments. These integrated results strongly suggest 
that haptoglobin, PON1 and zinc-alpha-2-glycoprotein 
may undergo specific sialylation changes at the glycoep- 
tide level in ovarian cancer. 

Further, the comparative analysis of N-linked sialylated 
glycopeptides from ovarian and prostate cancer serum 
samples revealed major variation in the level of sialyla- 
tion in these two cancers. The increased cancer specific 
sialylation in the N-glycopeptides in these two cancers 
also implies that the glycopeptides are potential targets 
for biomarker discovery. These preliminary observations 
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Table 3 List of differentially sialylated glycopeptides identified and quantitated in ovarian cancer and prostate cancer 
sera 

Glycopeptide sequence Heavy/Light ratio (peptide) 

Ovarian cancer Prostate cancer 



increase of sialylation in ovarian cancer and prostate cancer 






P00738 


Haptoglobin 


NLFLNHSENATAK 


2.0 


3.1 






WLHPNYSQVDIGLIK 


2.7 


3.8 


P00450 


Ceruloplasmin 


EHEGAIYPDNTTDFQR 


3.1 


3.9 


P02790 


Hemopexin 


ALPQPQNVTSLLGCTH 


2.1 


3.8 


P04114 


Apolipoprotein B-100 


fvegshnstvsl™ 


2.0 


2.9 


P01009 


Isoform 1 of Alpha-1 -antitrypsin 


YLGNATAIFFLPDEGK 


2.1 


5.5 


increase 


of sialylation only in ovarian cancer 








P27169 


Serum paraoxonase/arylesterase 1 


HANWTLTPLK 


4.0 


— 


P02787 


Serotransferrin 


CGLVPVLAENYNK 


7.6 


1.1 






QQQHLFGSNVTDCSGNFCLFR 


3.1 


— 


075882 


Attractin 


NHSCSEGQISIFR 


2.2 


— 


P25311 


Zinc-alpha-2-glycoprotein 


DIVEYYNDSNGSHVLQGR 


2.0 


— 


increase 


of sialylation only in prostate cancer 








P36980 


Complement factor H-related protein 1 


LQNNENNISCVER 


1.8 


2.2 


P 19823 


Inter-alpha-trypsin inhibitor heavy chain H2 


GAFISNFSMWDGK 


1.0 


4.2 


P08603 


Isoform 1 of Complement factor H 


IPCSQPPQIEHGTINSSR 


1.9 


3.5 






ISEENETOYMGK 


— 


2.8 






MDGASNVTCINSR 


1.0 


2.3 


P04004 


Vitronectin 


NNAWHEQVGGPSLTSDLQAQSK 


— 


2.4 


P02765 


Alpha-2-HS-glycoprotein 


AALAAFNAQNNGSNFQLEEISR 


1.6 


3.7 






KVCQDCPLLAPLNDTR 


1.1 


4.3 






VCQDCPLLAPLNDTR 


1.3 


2.4 


P02763 


Alpha-1 -acid glycoprotein 1 


QDQCIYNTTYLNVQR 


1.8 


2.1 


P00450 


Ceruloplasmin 


enltapgsdsavffeqg™ 


1.9 


3.1 






AGLQAFFQVQECNK 


0.4 


6.8 


P02790 


Hemopexin 


SWPAVGNCSSALR 


1.6 


4.4 


P27169 


Serum paraoxonase/arylesterase 1 


\^QVYAENGTVLQGSmSVYK 


— 


2.1 


P01042 


Isoform HMW of Kininogen-1 


ITYSIVQTNCSK 


1.6 


2.5 






LNAENNATFYFK 


1.2 


5.0 


P01871 


IGHM protein 


GLTFQQNASSMCVPDQDTAIR 




4.7 






YKNNSDISSTR 




2.3 


increase 


of sialylation in ovarian cancer and prostate cancer 






P00738 


Haptoglobin 


NLFLNHSENATAK 


2.0 


3.1 






WLHPNYSQVDIGLIK 


2.7 


3.8 


P00450 


Ceruloplasmin 


EHEGAIYPDNTTDFQR 


3.1 


3.9 


P02790 


Hemopexin 


ALPQPQNVTSLLGCTH 


2.1 


3.8 


P04114 


Apolipoprotein B-100 


fvegshnstvsl™ 


2.0 


2.9 


P01009 


Isoform 1 of Alpha-1 -antitrypsin 


YLGNATAIFFLPDEGK 


2.1 


5.5 



Swissprot Protein 
ID 
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Table 3 List of differentially sialylated glycopeptides identified and quantitated in ovarian cancer and prostate cancer 
sera (Continued) 

increase of sialylation only in ovarian cancer 



P27169 


Serum paraoxonase/arylesterase 1 


HANWTLTPLK 


4.0 




P02787 


Serotransferrin 


CGLVPVLAENYNK 


7.6 


1.1 






QQQHLFGSNVTDCSGNFCLFR 


3.1 




075882 


Attractin 


NHSCSEGQISIFR 


2.2 




P25311 


Zinc-alpha-2-glycoprotein 


DIVEYYNDSNGSHVLQGR 


2.0 




increase 


of sialylation only in prostate cancer 








P36980 


Complement factor H-related protein 1 


LQNNENNISCVER 


1.8 


2.2 


P 19823 


Inter-alpha-trypsin inhibitor heavy chain H2 


GAFISNFSM7VDGK 


1.0 


4.2 


P08603 


Isoform 1 of Complement factor H 


IPCSQPPQIEHGTINSSR 


1.9 


3.5 






ISEENETTCYMGK 




2.8 






MDGASNVTCINSR 


1.0 


2.3 


P04004 


Vitronectin 


N N ATVH EQVGG P S LTS D LQ AQS K 




2.4 


P02765 


Alpha-2-HS-glycoprotein 


AALAAFNAQNNGSNFQLEEISR 


1.6 


3.7 






KVCQDCPLLAPLNDTR 


1.1 


4.3 






VCQDCPLLAPLNDTR 


1.3 


2.4 


P02763 


Alpha-1-acid glycoprotein 1 


QDQCIYNTTYLNVQR 


1.8 


2.1 


P00450 


Ceruloplasmin 


enltapgsdsavffeqg™ 


1.9 


3.1 






AGLQAFFQVQECNK 


0.4 


6.8 


P02790 


Hemopexin 


SWPAVGNCSSALR 


1.6 


4.4 


P27169 


Serum paraoxonase/arylesterase 1 


^QVYAENGTVLQGSWASVYK 




2.1 


P01042 


Isoform HMW of Kininogen-1 


ITYSIVQTNCSK 


1.6 


2.5 






LNAENNATFYFK 


1.2 


5.0 


P01871 


IGHM protein 


GLTFQQNASSMCVPDQDTAIR 




4.7 






YKNNSDISSTR 




2.3 



List of differentially sialylated glycopeptides identified and quantitated in ovarian cancer and prostate cancer sera by using LTL quantitative proteomics method. 

were further confirmed by screening individual samples 
of various types and stages of ovarian cancer by western 
blot analysis, and the current data clearly demonstrate 
that the sialylation changes are unique to each N-linked 
glycopeptide glycosylation in a particular type of cancer 
and this approach may have a high potential for the de- 
velopment of early diagnostic strategies for cancer in 
general 

Methods 

Serum samples 

Serum samples were procured from ten patients with 
histologically confirmed ovarian carcinoma (primary 
diagnosis, regardless of stage of the disease), prior to 
surgery and chemotherapy. Patient samples were 
obtained under IRB approved protocols from patients 
undergoing treatment at Duke University Medical Cen- 
ter. Equal number (4 each) of serous adenocarcinoma, 
endometroid adenocarcinoma and clear cell carcinoma 




Ovarian 
Cancer 



Prostate 
Cancer 



Figure 7 Venn diagram showing the overlap and unique 
desialylated peptides identified in ovarian cancer sera and 
prostate cancer sera. 
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ovarian cancer samples were used. Serum samples from 
five age matched healthy female control individuals were 
purchased from Research Blood Components, LLC 
(Brighton, MA). Equal amounts of undiluted serum from 
each cohort were used to generate composites consisting 
of cancer and normal serum samples for the study. 

Chemicals and reagents 

HPLC grade acetonitrile (ACN) and water were obtained 
from Burdick and Jackson (Muskegon, MI). Methanol, cal- 
cium chloride (CaCl 2 ) and sodium phosphate (NaH 2 P0 4 ) 
were purchased from EMD chemical Inc. (Gibbstown, 
NJ). Ethanol was purchased from Decon Labs, Inc. (King 
of Prussia, PA). Acetic anhydride ( 1 H 6 / 2 D 6 ), H 2 s O water, 
phenylmethanesulfonyl fluoride (PMSF), ammonium bi- 
carbonate (AB), formic acid (FA), trifluoroacetic acid 
(TFA), dithiothreitol (DTT), iodoacetamide (IA), manga- 
nese chloride (MnCl 2 ), lactose were purchased from 
Sigma (St. Louis, MO). Sambucus nigra lectin (SNA) agar- 
ose was purchased from Vector Laboratories (Burlingame, 
CA). Spin-x columns were purchased from Corning, Inc. 
(Lowell, MA). Peptide-A/-glycosidase F (PNGase F) was 
purchased from QA BIO (Palm Desert, CA). Protein A/G 
beads and C-18 miniprep columns were purchased from 
Thermo Scientific (Waltham, MA). RapiGest SF, vacuum 
manifold and C-18 sep-pak cartridges were obtained from 
Waters (Milford, MA). Tris, NaCl and NaOH were pur- 
chased from Fisher (Pittsburgh, PA). PBS was purchased 
from Mediatech, Inc. (Manassas, VA). Ultra Centricon 3 
kDa filters were purchased from Millipore (Billerica, MA). 
Trypsin was purchased from Promega (Madison, WI). 
iTRAQ labeling kit was purchased from Applied Biosys- 
tems (Bedford, MA). 



In-solution digestion 

Prior to the in-solution digestion, serum samples were 
pre-cleared with spin-x columns and IgG was removed 
using protein A/G beads and buffer exchanged with 50 
mM AB using Ultra Centricon 3 kDa filters. The protein 
concentrations of normal and cancer serum samples 
were calculated by nanodrop technologies instrument 
(Wilmington, DE). 

Briefly, the protein solution was mixed with 50 ul of 
50 mM AB containing 0.1% of Rapigest SF and the pro- 
tein was reduced with DTT (5 um/ul in 50 mM AB) by 
incubating the mixture at 65°C for 45 min and alkylated 
with IA (15 |iin in 50 mM AB) by incubating the reac- 
tion mixture in dark for 30 min. Then the alkylated gly- 
coproteins were digested by trypsin (5 ng/ul in 50 mM 
AB) overnight at 37°C in a water bath. RapiGest was 
removed according to the vendors recommended 
protocol. 



Enrichment of glycopeptides using SNA lectin 

Tryptic peptides were mixed with 1 mM solution of 
PMSF prior to the glycoprotein enrichment by SNA lec- 
tin. The tryptic peptide sample was diluted to 4:1 with 5 
x binding buffer (100 mM Tris, 750 mM NaCl, 5 mM 
CaCl 2 and 5 mM MnCl 2 , pH 6.7). The lectin columns 
were prepared by adding 200ul of SNA lectin to the car- 
tridges and spun down at 1000 x g for 1 min to remove 
storage buffer. The lectin resin in columns was washed 
with 200 ul (2 x) of 1 x washing buffer (20 mM Tris, 
150 mM NaCl, ImM CalCl 2 and 1 mM MnCl 2 ). Then 
the sample was loaded onto the columns, mixed and 
incubated at room temperature (RT) for 10 min. The 
columns were centrifuged to collect the flow through 
and washed with 400 ul of 1 x washing buffer (2 x). 
Again, 400 ul of 1 x washing buffer was added and incu- 
bated for 5 min at RT (2 x). The flow through from 
samples was saved for additional labeling experiments. 
Finally, the glycopeptides were eluted from the lectin 
columns by adding 200 ul (2 x) of elution buffer (20 
mM Tris, 150 mM NaCl, ImM CalCl 2 , 1 mM MnCl 2 
and 200 mM lactose) and incubating for 10 min at RT. 
The flow through containing the glycopeptides were 
collected by centrifugation and purified by C-18 
chromatography. 

Acetylation of glycopeptides 

Glycopeptides obtained from normal and ovarian cancer 
serum samples and non-glycosylated serum peptides 
(lectin column flow through) were labeled with acetyl 
( 1 H 3 / 2 D 3 ) groups according to a published protocol [50]. 
Briefly, 2 ul of 1 H 6 -acetic anhydride or 2 D 6 -acetic anhyd- 
ride [50% (vol/vol) in methanol] were added to 100 ul of 
peptide mixture in 50% methanol/water (vol/vol). The 
mixture was allowed to react at 22°C for 15 min. The re- 
action was stopped by the addition of 2.2 ul of formic 
acid and equal aliquots of both samples were mixed. 

PNGase F digestion of glycopeptides 

The light (normal) and heavy (cancer) N-terminus/lysine 
labeled serum glycopeptides were mixed and digested 
with PNGase F to cleave the N-glycans from peptides 
according to the protocol described elsewhere [51]. 
Briefly, the glycopeptides were dried in vacuum and re- 
dissolved in 100 mM NaH 2 P0 4 buffer (pH 7.5). The 
PNGase F enzyme was dissolved in 50 mM AB. 
NaH 2 P0 4 buffer solution and AB buffer solution were 
prepared in H 2 s O before dissolving glycopeptides and 
enzyme, respectively. The reaction mixture was incu- 
bated with 6 ul of PNGase F at 37°C overnight. The re- 
action was stopped by adding 0.5% TFA solution. The 
resulting N-deglycosylated peptides were purified by C- 
18 chromatography. 
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Purification of peptides by C-18 chromatography 

The trypsin digested peptides, SNA enriched glycopep- 
tides and N-deglycosylated peptides were purified by C- 
18 reversed-phase (RP) chromatography using C-18 spin 
columns with the help of a vacuum manifold. Briefly, the 
C-18 columns were activated with 50% acetonitrile and 
equilibrated with a buffer containing 2% ACN and 0.1% 
TFA in water. Then peptide mixture (dissolved in 2% 
ACN and 0.1% TFA in water) was slowly loaded into the 
C-18 cartridge. Then the columns were washed thor- 
oughly with 0.1% TFA to remove salts and buffers. Fi- 
nally, the tryptic peptides were eluted with 70% 
acetonitrile twice (250 ul each), the two eluted fractions 
were combined and concentrated. The serum glycopep- 
tide mixture was fractionated by C-18 RP column (4.6 
mm diameter x 150 mm length) using an offline ultim- 
ate 3000 HPLC (Dionex, Sunnyvale, CA). Mobile phase 
A was 2% acetonitrile (ACN) and 0.1% formic acid (FA) 
in water, while mobile phase B was 0.1% FA and 90% 
ACN in water. Peptides were then eluted from the column 
with an 80 min linear gradient from 5 to 80% buffer B at 
a flow rate of 200uL/min. A total of 35 fractions were 
collected and each fraction was concentrated to 6 ul 
under vacuum. 

iTRAQ labeling of non-glycosylated peptides 

Peptide mixtures were labeled using iTRAQ reagents 
following the vendors recommended protocol. Briefly, 
each vial of iTRAQ Reagent was allowed to reach room 
temperature and spun to bring the solution to the bot- 
tom of the tube. 70 uL of ethanol was added to each 
iTRAQ reagent vial. Then the solution in each vial was 
vortexed to mix and then spun. The contents of one 
iTRAQ reagent vial were transferred to one sample tube. 
Each tube containing the reaction mixture was vortexed 
to mix and then spun. Finally, the reaction mixtures 
were incubated at room temperature for 1 hour and 
then combined and processed further. 

Purification and fractionation of iTRAQ labeled 
non-glycosylated peptides by SCX chromatography 

The iTRAQ labeled peptides mixture was purified and 
fractionated by an off-line HPLC using SCX analytical col- 
umn (PolySULFOETHYL A, 200x4.6mm; 5um, 300°A) 
(Poly LC Inc.). First, the concentrations of buffer salts and 
organics in iTRAQ labeled peptide solution were reduced 
by diluting the sample mixture by 10 fold with cation ex- 
change buffer- A (10 mM potassium phosphate (KH 2 P0 4 ) 
in 25% acetonitrile at pH 3.0). The sample mixture was 
reconstituted with 1 mL cation exchange buffer-load and 
vortexed to mix. An aliquot was removed and the pH was 
adjusted by adding 1M HC1 to bring it down to pH ~3. To 
remove the impurities from the peptide mixture, the gra- 
dient kept at 5% B in the first 20min and did not collect 



any fractions. Then the peptides were eluted by buffer-B 
(10 mM KH 2 P0 4 in 25% acetonitrile/350 mM potassium 
chloride (KC1) at pH 3.0) with the gradient of 0% to 95% B 
in 70 min and 95% to 95% B from 70 min to 80 min at a 
flow rate of 200uL/min. 

Mass spectrometry analysis 

A 3000 nano ultimate HPLC (Dionex, Sunnyvale, CA) 
was coupled with LTQ mass spectrometer (Thermo 
Electron, San Jose, CA) equipped with advanced nanos- 
pray source to analyze acetyl ( 1 H 3 / 2 D 3 ) and ls O labeled 
peptides. The serum N-deglycosylated peptide fractions 
were injected into LC-MS/MS system to identify degly- 
cosylated peptides. As a part of on-line sample clean-up 
step, the peptides were first concentrated in a C-18 RP 
trap column (Dionex) and then separated by using a 
75 um ID x 15cm C-18 RP analytical column (Dionex, 
Sunnyvale, CA) equilibrated in 4% ACN/0.1% FA at 250 
nL/min flow rate. Mobile phase A was 2% ACN and 
0.1% FA in water, while mobile phase B was 0.1% FA 
and 90% ACN in water. Peptides were separated with a 
4% to 50% B in 60 min and 50% to 80% in 90 min and 
eluted directly into an LTQ MS. The mass range in MS 
mode was 350 Da to 1800 Da and in MS/MS mode it 
was set as 100 Da to 2000 Da. N-deglycosylated peptides 
were analyzed by normal data dependent mode method 
in which the instrument was set to acquire fragment ion 
(MS/MS) spectra on the 4 most abundant precursor ions 
from each MS scan with a repeat count set of 1 and dur- 
ation of 30 sec. Dynamic exclusion was enabled for 180 
sec and the collision energy was 33. In a separate experi- 
ment, serum non-glycosylated peptide mixtures were 
analyzed by an LC-MS/MS experiment (in triplicate) 
with a long HPLC linear gradient (4% to 50% B in 120 
min and 50% to 80% in 180 min). 

The iTRAQ labeled non-glycosylated peptides were ana- 
lyzed by data dependent nano LC-MS/MS experiments on 
an Velos LTQ-Orbtrap mass spectrometer (Thermo 
Fisher) interfaced with a nano ultimate HPLC (Dionex). 
The sample was loaded onto a trap column of 100 um ID 
X 2 cm (L) packed with 5-um Magic C18 AQ (200 A, 3 
um, Michrom) and washed using 98% H 2 0, 2% ACN, 
0.05% TFA buffer at a flow rate of 10 ul/min for 5 min. 
The peptides were then separated by a self-packed 75um 
ID X 50 cm (L) fused silica column packed with 3-um 
Magic C18 AQ (200 A, 3 um, Michrom) using a linear 
gradient of Buffer B from 4% to 55% in 50 min at a flow 
rate of 300 nl/min. The buffer compositions were as fol- 
lows: Buffer A was 0.1% formic acid in water, buffer B was 
0.1% formic acid, 80% acetonitrile. The analytical column 
was coupled to the mass spectrometer via a nano ion 
source (Proxeon) with a metal emitter. The peptides were 
analyzed in the Orbitrap operated at 60,000 resolution in 
full scan (300-2000m/z) followed by 10 Data-Dependent 
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CID MS/MS scans (100-2000m/z) with 7,500 resolution 
and normalized collision energy of 45%. Survey scans were 
acquired in profile mode and MS/MS scans were acquired 
in centroid mode. Maximum injection times for MS and 
MS/MS were set to 500 and 1000ms, respectively. The 
precursor isolation width was set at ±1.2 Da and monoiso- 
topic precursor selection was enabled to exclude singly 
charged ions from MS/MS. The minimum intensity 
threshold for MS/MS fragmentation in the Orbitrap 
analyzer was 5000 counts and the dynamic exclusion was 
set to 60 sec with repeat count as one. 

Glycopeptide identification and quantitation 

The raw data were converted into DTA files using Bio- 
works 3.1 software (Thermo electron, San Jose, CA). All 
the 'DTA files were merged and converted into a single 
text file using an in-house software program. The pro- 
teins were identified by searching this text file of tandem 
mass spectrometry data in Swissprot human database 
with mascot 2.0 software (MatrixScience, London, U.K). 
The database search parameters were: mass type- 
monoisotopic precursor and fragment, enzyme-trypsin, 
threshold- 100, peptide tolerance- 1.5 Da, and fragment 
ion tolerance- 1.0. Variable modifications: N-terminus- 
acetyl ( 1 H 3 / 2 D 3 ), C-carbamidomethylation; K-acetyl 
( 1 H 3 / 2 D 3 ), N-deamidation (3 Da), M-oxidation, N- 
PNGase ls O. The search results, as well as the raw tan- 
dem mass spectrometry data were further verified 
manually and all glycopeptide sequences were identified 
with high confidence based on the presence of NXT/S 
motif and a 3 Da mass shift in b or y ions due to the 
modification at the N-terminus or K, respectively, by 
heavy acetyl ( 2 D 3 ) group. 

The isotope ratios of light and heavy forms of glyco- 
peptides were calculated manually by Xcalibur software 
(Thermo Electron, San Jose, USA) using peak areas 
derived from the XICs and dividing the area of the 
heavy isotope by the area of the light. Similarly, acetyl 
( 1 H 3 / 2 D 3 ) and ls O labeled glycopeptides in the serum 
analysis were quantified manually by using Xcalibur soft- 
ware program In case of multi-peptide protein identifi- 
cation, quantification was performed for at least one 
glycopeptide of each modification (acetyl ( 1 H 3 / 2 D 3 ) 
+ 18 0/acetyl ( 1 H 3 / 2 D 3 )+ ls O+M-oxdation/acetyl CH 3 / 
2 D 3 )+ ls O+esterification). The ratios of the light and 
heavy isotope-labeled peptides were determined by cal- 
culating the peak areas found in the corresponding 
extracted ion chromatograms (XICs) of deglycosylated 
peptides obtained with 1 Da mass window. These pep- 
tide ratios were also confirmed by the relative intensities 
of multiply charged parent peptide peaks in the MS 
spectrum. Mean ratios were also calculated for the pep- 
tides identified with multiple modifications as light and/ 
or heavy forms. Final protein ratios were calculated by 



averaging the peptide ratios of all modifications. Since 
there is only a 3 Da mass difference between differen- 
tially labeled deglycosylated peptides and overlapping of 
light and heavy isotopic envelops could introduce small 
errors in the quantitation, we considered sialylation is 
increased in a peptide only if heavy/light ratio of that 
peptide is > 2 fold. 

Protein identification and quantitation based on the non- 
glycosylated peptide analysis 

Glycopeptides and their corresponding proteins were 
identified by searching the LC-MS/MS raw data using 
proteome discoverer software (v 1.3) with Sequest search 
algorithm (Thermo). The data generation and database 
search parameters were: m.wt range: 375 Da- 1500 Da, 
threshold: 200 counts, charge state— auto, MS n level— 
MS 2 , activation type— HCD, precursor ion tolerance: 20 
ppm, fragment ion tolerance - 0.02 Da, minimum ion 
count: 10, in databse - Swissprot human, oxidation of M: 
16 Da, N-terminus/K modification by iTRAQ reagents: 
144 Da, enzyme - trypsin. The search results were also 
verified manually to confidently identify the correct pep- 
tide sequence. Quantitation was done by selecting a 4- 
plex iTRAQ default method which contains the isobaric 
correction parameters for all the iTRAQ labels, however, 
only 3 labels (114,115,116) were selected for quantitation 
of non-glycosylated peptides. Peptides quantified with > 
1.5 fold increase and with < 1.5 fold decrease in concen- 
tration were considered as up regulated and down regu- 
lated, respectively, in the current analysis. 

Western blot analysis 

Validation of differentially sialylated glycoproteins 

For each selected protein, 10 ug of each of the indicated 
serum samples were resolved on a 10% Tris-Glycine gel 
and the proteins were transferred onto PVDF membrane 
using a wet transfer apparatus. Various proteins in the 
serum (Haptoglobin, PON1, Zinc Alpha 2 glycoprotein) 
were probed with a 1:1000 dilution of commercial anti- 
bodies in Odyssey blocking buffer + 0.1% Tween 20 
[Sigma, catalog #H8636 (anti-Haptoglobin); Abeam, 
catalog #ab24261 (anti-PONl); Abeam, catalog 
#ab47116 (anti-zinc alpha 2 glycoprotein)] followed by a 
1:5000 dilution of secondary reagents in Odyssey block- 
ing buffer + 0.1% Tween 20 [Licor Biosciences, catalog 
#926-32210 (anti-mouse IgG IRDye800CW) or catalog 
#926-32211 (anti-rabbit IgG IRDye 800CW)]. Blot was 
developed using fluorescence detection on the Odyssey 
Infrared Imaging System from Licor Biosciences. 

Individual patient sample analysis 

Western blot analysis was also performed on individual 
samples obtained from serous, endometroid and clear 
cell carcinoma of stage I and Stage II-IV ovarian cancer 
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patients. These samples were subjected to SNA lectin 
enrichment and probed with PON1 or haptoglobin anti- 
bodies. Pooled stage I and stage II-IV samples were also 
subjected to western blot experiments where pooled 
healthy female sample used as control. Serum samples 
were analyzed individually or pooled (Before) and lOOuL 
of these were enriched using SNA Lectin as per the 
product protocol (Vector Labs AL-1303) (After). Follow- 
ing enrichment, approximately 300ug were taken and 
treated with PNGase in the presence of a lx protease in- 
hibitor cocktail and 5% NP40 for 2 hours at 37°C. Fol- 
lowing treatment with SNA, the protein concentrations 
were determined using the BCA Assay. Samples were 
loaded on a 10% TrisGlycine gel as follows - for PON1, 
5ug of the Before samples and 2.4ug of the After sam- 
ples were loaded/ well; for Haptoglobin, lug of each sam- 
ple was loaded/well, and the proteins were transferred 
onto PVDF membrane using a wet transfer apparatus. 
The membranes were blocked using Odyssey Blocking 
Buffer and probed with Haptoglobin (Sigma H8636) and 
PON1 (abeam ab24261) antibodies, followed by a corre- 
sponding IRDye secondary antibody (Licor Biosciences 
#926-32210 or #926-68021). The blot was developed using 
the Odyssey Infrared Imaging System (Licor Biosciences). 

Investigation of O-sialylated glycoprotein contribution 

Glycoproteins were enriched from lOOuL each of the 
pooled serum samples using SNA Lectin as per the 
product protocol (Vector Labs AL-1303). Following en- 
richment, approximately 300ug of the sample was trea- 
ted with PNGase in the presence of a lx protease 
inhibitor cocktail and 5% NP40 for 2 hours at 37°C. 
After treatment with SNA and/or PNGase, the protein 
concentrations were determined using the BCA Assay. 
Ten microgram of the sample was loaded/ well on a 10% 
TrisGlycine gel and the proteins were transferred onto 
PVDF membrane using a wet transfer apparatus. The 
membranes were blocked using Odyssey Blocking Buffer 
and probed with Haptoglobin (Sigma H8636) antibodies 
followed by a corresponding IRDye secondary antibody 
(Licor Biosciences #926-32210 or #926-68021). The blot 
was developed using the Odyssey Infrared Imaging Sys- 
tem (Licor Biosciences). 
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